Corpus: fra_news_2005_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 94 97 98 98 98
1000 719 868 957 979 981
10000 6199 8651 9600 9868 9923
100000 33211 66276 88180 96312 98475
1000000 65810 161054 241637 278566 290420


Zipf's diagram for sentence endings


Gnuplot diagram

11913 msec needed at 2018-03-02 15:31